A Geometrical Approach to Topic Model Estimation
نویسنده
چکیده
In the probabilistic topic models, the quantity of interest—a lowrank matrix consisting of topic vectors—is hidden in the text corpus matrix, masked by noise, and the Singular Value Decomposition (SVD) is a potentially useful tool for learning such a low-rank matrix. However, the connection between this low-rank matrix and the singular vectors of the text corpus matrix are usually complicated and hard to spell out, so how to use SVD for learning topic models faces challenges. In this paper, we overcome the challenge by revealing a surprising insight: there is a low-dimensional simplex structure which can be viewed as a bridge between the low-rank matrix of interest and the SVD of the text corpus matrix, and allows us to conveniently reconstruct the former using the latter. Such an insight motivates a new SVD approach to learning topic models, which we analyze with delicate random matrix theory and derive the rate of convergence. We support our methods and theory numerically, using both simulated data and real data.
منابع مشابه
Application of Model-Based Estimation to Time-Delay Estimation of Ultrasonic Testing Signals
Time-Delay-Estimation (TDE) has been a topic of interest in many applications in the past few decades. The emphasis of this work is on the application of model-based estimation (MBE) for TDE of ultrasonic signals used in ultrasonic thickness gaging. Ultrasonic thickness gaging is based on precise measurement of the time difference between successive echoes which reflect back from the back wall ...
متن کاملEstimation of Return to Scale under Weight Restrictions in Data Envelopment Analysis
Return-To-Scale (RTS) is a most important topic in DEA. Many methods are not obtained for estimating RTS in DEA, yet. In this paper has developed the Banker-Trall approach to identify situation for RTS for the BCC model "multiplier form" with virtual weight restrictions that are imposed to model by DM judgments. Imposing weight restrictions to DEA models often has created problem of infeasibili...
متن کاملA HYBRID SUPPORT VECTOR REGRESSION WITH ANT COLONY OPTIMIZATION ALGORITHM IN ESTIMATION OF SAFETY FACTOR FOR CIRCULAR FAILURE SLOPE
Slope stability is one of the most complex and essential issues for civil and geotechnical engineers, mainly due to life and high economical losses resulting from these failures. In this paper, a new approach is presented for estimating the Safety Factor (SF) for circular failure slope using hybrid support vector regression (SVR) and Ant Colony Optimization (ACO). The ACO is combined with the S...
متن کاملThe progress mechanism of track geometrical irregularity focusing on hanging sleepers
This topic is very traditional and basically has been concerned for a long time. A lot of sophisticated vehicle/track interaction model and track dynamic deterioration model have been developed so far. The author also developed a track settlement progress model comprising a vehicle/track interaction model and a track settlement law. Hanging sleepers are usually caused more or less by the dyn...
متن کاملBayesian Hybrid Model-State Estimation Applied to Simultaneous Contact Formation Recognition and Geometrical Parameter Estimation
This paper describes a Bayesian approach to model selection and state estimation for sensor-based robot tasks. The approach is illustrated with a hybrid model-state estimation example from force-controlled autonomous compliant motion: simultaneous (discrete) Contact Formation recognition and estimation of (continuous) geometrical parameters. Previous research in this area mostly tries to solve ...
متن کاملHammerstein-Wiener Model: A New Approach to the Estimation of Formal Neural Information
A new approach is introduced to estimate the formal information of neurons. Formal Information, mainly discusses about the aspects of the response that is related to the stimulus. Estimation is based on introducing a mathematical nonlinear model with Hammerstein-Wiener system estimator. This method of system identification consists of three blocks to completely describe the nonlinearity of inp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1608.04478 شماره
صفحات -
تاریخ انتشار 2016